# Visual Language Model
Bespoke MiniChart 7B
A 7B-parameter open-source chart understanding vision-language model developed by Bespoke Labs, outperforming closed-source models like Gemini-1.5-Pro in chart QA tasks
Text-to-Image
Safetensors English
B
bespokelabs
437
12
Instancecap Captioner
Other
A visual language model fine-tuned on the instancevid dataset based on Qwen2.5-VL-7B-Instruct, specializing in instance-level image description generation
Image-to-Text
Transformers

I
AnonMegumi
14
1
Moondream Next
Pre-release version of moondream, primarily for internal testing.
Large Language Model
Transformers

M
vikhyatk
153
40
Cogvlm Grounding Generalist Hf
CogVLM is a powerful open-source visual language model (VLM) that has achieved SOTA performance on multiple cross-modal benchmarks.
Image-to-Text
Transformers

C
THUDM
702
16
Featured Recommended AI Models